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Amendments to the Claims : 

This listing of the claims will replace all prior versions, and listings, of claims in the application. 

1 . (currently amended) A method for transposing data in a plurality of processing elements, 
comprising; 

shifting the data along a plurality of diagonals of the plurality of processing elements 
until the processing elements in th e diagonal hav e each of said plurality of diagonals has 
received the data held by every other processing element in that diagonal; and 

selecting data as final output data based on a processing element's position. 

2. (original) The method of claim 1 additionally comprising one of loading an initial count 
into each processing element and calculating an initial count locally based on the processing 
element's location, said selecting being responsive to said initial count. 

3. (original) The method of claim 2 wherein said plurality of processing elements is 
arranged in an array and said initial count is given by one of the following expressions: 

(x + y +1) MOD (array size) 
(C + R + 1) MOD (array size) 
(C + y + 1) MOD (array size) or 
(x + R +1) MOD (array size). 

4. (original) The method of claim 2 additionally comprising maintaining a current count in 
each processing element, said current count being responsive to said initial count and the number 
of data shifts performed, said selecting being responsive to said current count. 

5. (original) The method of claim 4 wherein said maintaining a current count includes 
altering said initial count at programmable intervals by a programmable amount. 

6. The method of claim 4 wherein said initial count is decremented in response to said 
shifting of data to produce said current count. 

7. (original) The method of claim 4 wherein said selecting occurs when said current count 
is non-positive. 
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8. The method of claim 1 additionally comprising maintaining a local count including 
setting a counter to a first known value, and counting up from said first known value based on 
the number of shifts that have been performed, said selecting occurring when a current count 
equals a target count. 

9. (original) The method of claim 1 wherein said shifting includes a combination of vertical 
and horizontal shifting. 

10. (original) The method of claim 1 wherein said shifting includes a combination of shifting 
in the x and z directions. 

1 1 . (currently amended) A method for transposing data in an array of processing elements, 
comprising: 

shifting the data along diagonals in the array a number of times equal to N-l where N 
equals the number of processing elements in a diagonal; and 

outputting data from each processing element as a function of that element's position in a 
diagonal. 

12. (original) The method of claim 1 1 additionally comprising one of loading an initial count 
into each processing element and calculating an initial count locally based on the processing 
element's position in a diagonal, said outputting being responsive to said initial count. 

13. (original) The method of claim 12 wherein said initial count is given by one of the 
following expressions: 

(x + y +1) MOD (array size) 
(C + R+ 1) MOD (array size) 
(C + y + 1) MOD (array size) or 
(x + R +1) MOD (array size). 

14. (original) The method of claim 12 additionally comprising maintaining a current count in 
each processing element, said current count being responsive to said initial count and the number 
of data shifts performed, said outputting being responsive to said current count. 

15. (original) The method of claim 14 wherein said maintaining a current count includes 
altering said initial count at programmable intervals by a programmable amount. 
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16. (original) The method of claim 14 wherein said initial count is decremented in response 
to said shifting of data to produce said current count. 

1 7. (original) The method of claim 16 wherein said outputting occurs when said current 
count is non-positive. 

18. (original) The method of claim 12 additionally comprising maintaining a local count 
including setting a counter to a first known value, and counting up from said first known value 
based on the number of shifts that have been performed, said outputting occurring when a current 
count equals a target count. 

19. (original) The method of claim 1 1 wherein said shifting includes a combination of 
vertical and horizontal shifting. 

20. (original) The method of claim 1 1 wherein said shifting includes a combination of 
shifting in perpendicular directions. 

21. (currently amended) A method for transposing data in a plurality of processing elements, 
comprising: 

shifting data between processing elements arranged in diagonals; 
setting an initial count in each processing element according to one of the expressions: 
(x + y +1) MOD (array size) 
(C + R + 1) MOD (array size) 
(C + y + 1) MOD (array size) or 
(x + R +1) MOD (array size) 
where v and R are numbers indicating a row and a position in the row of a processing 
element and C and x are numbers indicating a column and a position in the column of a 
processing element; 

modifying the initial count by a programmable amount and at programmable intervals to 
produce a current count; and 

selecting output data as a function of said current count. 

22. (original) The method of claim 21 wherein said modifying includes counting down from 
said initial count. 
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23. (original) The method of claim 22 wherein said selecting occurs when said current count 
is a non-positive value. 

24. (original) The method of claim 21 wherein said shifting includes a combination of 
vertical and horizontal shifting. 

25. (original) The method of claim 21 wherein said shifting includes a combination of 
horizontal shifting. 

26. (currently amended) A computer readable memory device carrying an ordered set of 
instructions which, when executed, perform a method comprising: 

shifting the data along a plurality of diagonals of the plurality of processing elements 
until the processing elements each of said plurality of diagonals has in th e diagonal hav e 
received the data held by every other processing element in that diagonal; 

selecting data as final output data based on a processing element's position. 
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